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ABSTRACT 

The  problem  of  securing  multicast  communications  in  an 
energy-constrained  ad-hoc  network  requires  the  efficient 
management  of  cryptographic  quantities.  We  show  that 
existing  efficient  key  distribution  techniques  for  wired 
networks  that  rely  on  logical  hierarchies  are  extremely 
energy  inefficient.  We  also  show  that  the  consideration  of 
the  physical  location  of  the  members  is  critical  for  devel¬ 
oping  energy-efficient  key  distribution  schemes.  By  ex¬ 
ploiting  the  spatial  correlation  between  the  members  of 
the  multicast  group,  we  construct  an  energy-aware  key 
distribution  scheme.  We  present  simulation  results  to 
illustrate  the  improvements  achieved  by  our  proposed 
algorithm. 

1.  INTRODUCTION 

When  an  identical  message  has  to  be  sent  to  multiple 
receivers,  multicast  communications  model  reduces  the 
sender  as  well  as  the  network  management  overhead. 
Many  applications  that  make  use  of  single-sender- 
multiple-receiver  communication  model  can  benefit  from 
multicast  mode.  In  order  to  secure  the  communication 
channel,  the  multicast  communication  should  be  en¬ 
crypted  [1],  The  use  of  symmetric  key  cryptography  al¬ 
lows  the  sender  to  perform  one  encryption  (in  broadcast 
mode)  and  every  user  to  perform  one  decryption  per  mes¬ 
sage,  thus  reducing  the  computational  and  communica¬ 
tion  overhead.  However,  the  use  of  a  single  key  known  to 
all  members,  requires  its  update  each  time  a  group  mem¬ 
ber  joins  or  leaves  the  group,  in  order  to  provide  back¬ 
ward  and  forward  traffic  protection.  Since  every  member 
holds  the  data  encryption  key  also  known  as  session  en¬ 
cryption  key  (SEK),  when  a  member  leaves  the  group,  a 
secure  channel  to  reach  the  remaining  valid  members  for 
the  update  of  the  SEK  is  required.  Hence,  the  group  has 
to  have  additional  keys  called  Key  Encrypting  Keys 
(KEKs)  [1], 

The  key  management  problem  is  to  ensure  that  only 
valid  members  hold  the  valid  keys  at  any  time.  The  key 


management  problem  can  be  reduced  to  the  key  distribu¬ 
tion  problem,  which  involves  the  secure  and  efficient  dis¬ 
tribution  of  the  SEK  and  the  KEKs  to  valid  members.  In 
case  of  wired  networks,  the  rooted  tree  based  hierarchical 
key  distribution  schemes  are  known  to  be  optimal  [1],  In 
[2],  these  results  were  directly  used  for  energy- 
constrained  sensor  networks.  However,  as  we  show  in  this 
paper,  such  models  are  not  energy-efficient.  We  present 
an  energy-aware  key  distribution  scheme  that  makes  use 
of  the  geographical  location  information  about  the  multi¬ 
cast  members,  for  achieving  efficient  key  distribution. 

2.  THE  AD-HOC  NETWORK  ENVIRONMENT 

We  assume  that  omni-directional  antennas  are  used  for 
transmission  and  reception  of  the  signal.  The  required 
power  Pd  for  reaching  a  receiver  at  a  distance  d  is  propor¬ 
tional  to  the  Y '  power  of  that  distance,  with  2  <  y  <  4  . 
Assuming  the  proportionality  constant  to  be  one,  we  have 
Pj=dr. 

We  now  demonstrate  how  transmission  power  (a 
quantity  defined  in  the  physical  layer),  affects  the  way  the 
routing  procedure  is  realized  at  the  network  layer.  The 
wireless  medium  along  with  the  omni-directional  anten¬ 
nas,  offer  the  unique  characteristic  of  the  broadcast  ad¬ 
vantage  [3],  In  Fig.  1(a),  sender  S  transmits  a  mes¬ 
sage  to  node 


Fig.  1.  (a)  Broadcast  advantage  for  members  Mi-M?.  (b)  S 
transmits  an  identical  message  to  both  receivers 

Mi,  all  nodes  that  lie  within  the  circle  of  radius  SM , 
receive  the  message  for  “free”. 

We  now  show  the  impact  of  this  physical  layer  prop¬ 
erty  on  the  routing  decision.  In  Fig.  1(b),  assume  that 
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d2>d\  and  that  the  sender  S  needs  to  transmit  an  identical 
message  to  nodes  M\  and  M2.  A  simple  strategy  would  be 
to  use  unicast  transmissions  requiring  a  total  energy  ex¬ 
penditure  of  (d\y+di;).  However,  the  broadcast  nature  of 
the  wireless  medium  can  reduce  this  expenditure  with  the 
use  of  one  of  the  two  following  strategies:  (a)  transmit  to 
Mi  and  let  M\  relay  the  message  to  M2.  (b)  transmit  to  M2 
and  let  M\  receive  the  message  for  free,  since  d2>d\  (due 
to  broadcast).  This  leads  to  the  following  rule:  if 
df>{d\+d\2)  then  the  sender  chooses  the  strategy  (a), 
otherwise  strategy  (b)  is  preferred.  Note  that  the  routing 
decision  relies  solely  on  the  nodes’  physical  location  if 
identical  path  loss  model  is  assumed. 

3.  IMPACT  OF  THE  PHYSICAL  LOCATION  INTO 
THE  KEY  DISTRIBUTION  SCHEME 

We  now  demonstrate  the  need  for  consideration  of  the 
geographical  location  information  in  the  construction  of 
the  key  distribution  scheme.  Such  information  can  be 
obtained  using  the  Global  Positioning  System  (GPS)  [4]. 
In  Fig.  2,  we  represent  a  wireless  network  of  7  nodes, 
with  one  of  them  being  the  sender,  also  known  as  the 
group  controller  ( GC ),  and  two  intermediate  nodes  R\,  R2 
relaying  traffic  to  four  receiving  nodes  MrM4,  which 
form  a  multicast  group.  The  energy  required  for  sending 
a  message  from  the  GC  to  the  two  relay  nodes  is  set  to 
one  unit  and  the  energy  required  for  sending  a  message 
from  the  relay  nodes  to  the  receiving  nodes  is  also  set  to 
one  unit.  Hence,  the  GC  needs  to  perform  only  one 
transmission  to  reach  R\,R2  and,  relay  nodes  R\,  R2  need 
to  perform  one  transmission  each  to  reach  {Mi,  M2}  and 
{M2,  M4},  respectively. 

Fig.  3  presents  two  different  key  distribution  strategies 
for  the  multicast  group  in  Fig.  2.  The  one  in  Fig.  3(a)  is 
built  according  to  the  available  geographical  location 
information  (close-by  members  are  placed  adjacently  into 
the  key  tree),  while  the  one  in  Fig.  3(b)  is  a  result  of  a 
random  placement  of  the  members  into  the  leaves  of  the 
tree  (logical  assignment).  All  the  nodes  share  the  key  K0. 


Fig.  2.  Routing  tree  of  an  energy  constrained  wireless  network 
Fig.  3.  (a)  A  hierarchical  tree  based  key  distribution  scheme 
based  on  geographical  location  information,  (b)  A  logical  hier¬ 
archical  tree  key  distribution  scheme. 

Let’s  assume  that  key  K0  has  been  compromised  and 
needs  to  be  replaced  by  the  new  keyKo’.  For  scheme  in 


Fig.  3(a),  the  GC  generates  encrypted  messages  [K 0 

and  {Kq']  and  transmits  them  to  relay  nodes  R\  and 

R2,  respectively.  Node  R\  performs  one  transmission  to 
Mi,  M2  and  R2  performs  one  transmission  to  M2,  M4.  The 
total  energy  expenditure  is  4  energy  units.  For  scheme  in 
Fig.  3(b),  the  GC  transmits  two  messages  to  both  Ru  R2. 
Both  R\  and  R2  need  to  transmit  twice  to  reach  nodes  Mi, 
M2  and  M2,  M4,  since  nodes  that  do  not  share  common 
keys  cannot  be  reached  with  a  single  transmission.  The 
scheme  in  Fig.  3(b)  requires  6  energy  units.  Hence,  for 
this  example,  the  consideration  of  the  physical  location 
information  in  the  realization  of  the  key  distribution 
scheme  leads  to  energy  savings  of  33. 

4.  ENERGY-AWARE  KEY  DISTRIBUTION  USING 
GEOGRAPHICAL  LOCATION  INFORMATION 

We  now  propose  an  energy-aware  key  distribution 
scheme,  which  uses  geographical  location  information. 
We  make  the  observation  that  members  that  are  spatially 
close  to  each  other  can  potentially  be  reached  with  broad¬ 
cast,  or  use  the  same  routing  paths  to  receive  data.  If  we 
represent  members  as  points  in  the  2-dimentional  plane, 
we  can  employ  a  clustering  algorithm  to  cluster  them  into 
groups  and  construct  a  hierarchical  key  tree  structure. 

Though  one  can  use  any  suitable  clustering  algorithm, 
we  have  developed  a  variant  of  K-means  (a  popular  algo¬ 
rithm  in  pattern  recognition  and  classification),  algorithm 
for  creating  appropriate  clusters.  K-means  algorithm  is 
used  due  to  its  ease  of  implementation  and  the  ability  to 
control  the  number  of  steps  in  which  it  terminates  [5]. 
The  goal  of  K-means  is  to  create  K  clusters  out  of  N 
points  ( K<N)  such  that  a  “loss”  cost  function  is  mini¬ 
mized  with  respect  to  a  dissimilarity  measure.  K-means 
uses  the  squared  Euclidean  distance  as  a  dissimilarity 
measure.  If  the  coordinates  of  point  i  are  Xi  =  (x\  \ ,  xl2),  the 
Euclidean  distance  is  equal  to: 

2 

dix^Xi  j  =  2j(^- -xvj)2  =  \\xi~Xi  ’I"  (1) 

j= i 

When  y  =  2,  the  Euclidean  distance  is  proportional  to  the 
transmission  power  and  is  suitable  for  our  clustering  pro¬ 
cedure.  In  case  where  y  ^  2,  we  can  modify  the  dissimi¬ 
larity  measure  to  be  proportional  to  the  transmission 
power  as  in  (2). 

d(xi,xi')  =  \\xi-xi'f  (2) 

In  our  analysis  we  will  focus,  without  loss  of  generality, 
on  the  case  where  y  =  2.  When  y  ^  2  instead  of  K-means, 
K-medoids  can  be  used  to  solve  the  optimization  problem 
at  the  expense  of  increased  computational  complexity  [5]. 
The  “loss”  function  that  is  minimized  in  K-means  is: 


^c)=jZZ  ZIMI 

Z  k= 1  C(i)=kC(i')=k 


=  Z  Z  \\Xi~X*t  (3) 

k= 1  C(i)=k 

where  C(i)  =  k  symbolizes  the  assignment  of  the  ith  mem¬ 
ber  to  the  k'h  cluster,  xk  =  (xlk,x2i )  is  the  mean  vector  of 

cluster  k  and  C  is  the  resulting  cluster  configuration  (/V 
members  assigned  into  K  clusters).  The  “loss”  function  is 
minimized  by  assigning  the  N  members  to  K  clusters  in 
such  a  way,  that  within  each  cluster  the  average  dissimi¬ 
larity  (distance  between  points  assigned  to  the  same  clus¬ 
ter  and  the  cluster  mean)  is  minimized. 

K 

C*  =  arg  min  ^  ^  ||x,-  -  xk  ||"  .  (4) 

k=l  C(i)=k 

For  a  set  of  points  S, 

xs  =  arg  min  ^  |x;  -  m ||2  ,  (5) 

m  ieS 


where  m  is  the  mean  point  of  the  set  S.  We  can  obtain  C* 
by  solving  the  enlarged  optimization  problem 

K 


(mn,:  Z  Z  IK-^I 

c>{mk  h  k=lC(i)=k 


(6) 


where  mk  is  the  mean  of  the  k'1'  cluster.  The  iterative  algo¬ 
rithm  for  solving  (5)  is  as  follows  [5]: 

K-means  algorithm: 


Step  1:  Have  an  arbitrary  assignment  C  of  points  into  the 
specified  number  of  clusters  K  (initialization  can 
be  done  by  assigning  the  i‘h  point  to  the  i  mod  K 
cluster).  Compute  the  mean  vector  for  each  clus¬ 
ter. 

Step  2:  For  the  given  assignment  C,  the  cluster  variance 
as  expressed  in  (5)  is  minimized  with  respect  to 
{mu...,mf  yielding  the  means  of  the  currently 
assigned  clusters  (5). 

Step  3:  Given  a  current  assignment  of  means  {m\,...,m¥J, 
(6)  is  minimized  by  assigning  each  point  to  the 
closest  cluster  mean.  That  is, 

II  II2 

C{i)  =  argmin pc,.  -  mk  I  (7) 

1  <k<K 

Step  4:  Iterate  step  2  and  3  until  the  cluster  memberships 
do  not  change. 


We  now  propose  a  method  that  utilizes  K-means  algo¬ 
rithm  in  order  to  construct  a  hierarchical  tree  structure. 
Using  K-means,  N  members  are  assigned  into  2  clusters. 
Since  K-means  does  not  guarantee  that  equal  number  of 
members  will  be  assigned  to  each  cluster  or  N  might  be 
odd,  a  refinement  procedure  for  assigning  equal  number 
of  members  to  each  cluster  takes  place.  This  refinement 


leads  to  a  construction  of  a  balanced  key  tree  when  N  =  2n 
and  forces  a  structure  as  close  to  the  balanced  as  possible 
otherwise.  At  every  following  step  all  clusters  are  further 
divided  into  two  new  clusters  and  the  refinement  proce¬ 
dure  for  each  pair  of  clusters  is  repeated.  The  algorithm 
terminates  when  we  have  created  clusters  of  two  members 
(after  log2-/V  splits  when  N  =  2n). 

Key  distribution  scheme  using  refined  K-means: 

Step  1:  Assign  all  points  to  an  initial  global  cluster. 

Step  2:  Divide  each  cluster  into  two  clusters  using  the  K- 
means  algorithm. 

Step  3:  Use  the  refinement  procedure  detailed  below  to 
balance  the  number  of  points  that  are  assigned  to 
each  cluster,  i.e.  assign  the  same  number  of 
points  to  each  cluster. 

Step  4:  Iterate  step  2  and  3  until  clusters  of  two  or  one 
points  have  been  created. 

Step  5:  Merge  single  points,  if  possible,  with  the  use  of 
K-means  for  only  single  points. 

Step  6:  Map  the  cluster  hierarchy  into  tree  hierarchy. 

If  cluster  Ci  has  |Ci|  points  and  cluster  C2  has  IC2I 
points  with  |Ci|>|C2|,  the  refinement  procedure  moves 

N|C1|  — |C2||/2j  points  from  Ci  to  C2.  The  criterion  by 

which  the  points  to  be  moved  are  selected,  is  the  mini¬ 
mum  Euclidean  distance  of  members  belonging  to  cluster 
Ci  from  the  point  expressing  the  mean  vector  mCi  of 

cluster  C2. 

Refinement  Procedure: 

Cmin  min{\Ci\,  IC2I},  Cm ax  /wux{|Ci|,  IC2I} 

for  k=l  to  |j|Cmax | - |Cmin |)/2j do 

find  i*  g  Cmax  such  that 


d(xf,mr  )  =  min 

'  *  min 


c 

'-'IT 


and  move  it  to  cluster  Cm 
recalculate  means  mc-  ,  mc^  . 


(8) 


Fig  4:  Application  of  refined  K-means  algorithm  to  an  eight- 
node  network. 


In  Fig.  4(a),  the  results  of  the  application  of  the  refined 
K-means  algorithm  for  an  eight-node  network  are  shown. 
Initially  two  clusters  are  constructed  with  centers  6),  G2 


containing  members  {Mn,  M8}  and  {Mu  M2,  M3,  M4,  M5, 
M6}.  After  the  application  of  the  refinement  procedure, 
members  M5  and  M6  are  moved  from  G2  to  G\  in  order  to 
form  two  clusters  of  equal  size.  A  second  application  of 
K-means  results  in  the  shown  final  four  clusters  and  the 
tree  in  Fig.  4(b)  is  built. 

5.  SIMULATION  RESULTS  AND  DISCUSSION 

In  this  section  we  present  the  results  of  the  application  of 
our  algorithm  in  a  large  number  of  simulation  experi¬ 
ments. 

Simulation  was  performed  in  randomly  generated  net¬ 
work  topologies  confined  in  a  10x10  square  grid  region. 
After  the  network  generation,  BIP  algorithm  [3]  was  ap¬ 
plied  in  order  to  provide  the  routing  tree.  The  routing  tree 
is  used  to  calculate  the  consumed  energy  for  transmitting 
to  each  member  or  group  of  members  from  the  GC  an 
updated  key.  In  our  network  model,  members  have  the 
ability  to  transmit  with  infinite  power  and  the  propaga¬ 
tion  loss  factor  is  set  to  y= 2.  Mobility  is  not  considered  in 
our  network  model. 

For  the  comparison  of  the  performance  of  our  algo¬ 
rithm,  10,000  key  distribution  trees  are  randomly  gener¬ 
ated.  The  energy  for  re-keying  the  key  tree  constructed 
from  the  refined  K-means  algorithm  is  compared  to  the 
minimum,  maximum,  median  energy  required  for  the 
update  of  the  10,000  trees.  The  size  of  the  multicast 
group  takes  different  values  N=  16,  32,  64,  128  plus  the 
GC. 

Network  topologies  of  32  nodes  Average  savings 


(a)  (b) 

Fig  5:  (a)  Performance  of  K-means  compared  to  10,000  random 
key  trees  for  10  different  network  topologies,  (b)  Average  sav¬ 
ings  using  the  K-means  algorithm  for  networks  with  different 
number  of  nodes. 

In  Fig.  5(a)  the  savings  for  10  different  randomly 
generated  network  topologies  are  shown.  In  Fig.  5(b)  the 
average  savings  are  shown.  The  savings  are  averaged 
over  100  different  network  topologies  for  each  value  of  A 
and  10,000  trees  are  used  for  comparison  for  each  topol¬ 
ogy. 

We  can  observe  that  refined  K-means  algorithm  sig¬ 
nificantly  out-performs  the  Logical  Key  Hierarchy  (LKH) 
[1]  key  distribution  scheme  in  energy  efficiency.  Refined 
K-means  procedure  provides  15%-3  7%  savings  from  the 
worst  possible  assignment,  15%-26%  savings  from  the 
median  case  and  does  almost  as  good  as  the  best  possible 
case  out  of  10,000  trees. 


However,  there  are  cases  where  the  application  of  our 
algorithm  results  in  higher  energy  expenditure,  compared 
to  a  randomly  generated  tree.  This  fact  reveals  the  non¬ 
optimality  of  our  algorithm.  There  are  cases  that  refined 
K-means  fails  to  accurately  exploit  the  broadcast 
advantage.  The  reason  for  this  failure  lies  in  the  omni¬ 
directionality  of  the  broadcast  advantage.  If  two  nodes  are 
almost  at  the  same  distance  from  the  GC,  but  in  an  oppo¬ 
site  direction,  refined  K-means  procedure  will  fail  to  cap¬ 
ture  the  benefit  from  clustering  them  together,  since  those 
nodes  will  not  be  spatially  correlated. 

The  performance  of  the  refined  K-means  algorithm  re¬ 
lies  also  on  the  homogeneity  of  the  environment  where 
the  ad-hoc  network  is  deployed.  If  the  path  loss  model  is 
identical  throughout  the  network  terrain,  then  the  physi¬ 
cal  location  of  the  nodes  is  directly  connected  with  the 
energy  spent  for  reaching  them.  Hence,  the  observation 
that  close  by  members  will  receive  information  through 
the  same  routing  paths  is  valid.  However,  when  the  envi¬ 
ronment  is  not  homogeneous  but  different  path  loss  mod¬ 
els  must  be  assumed  for  the  description  of  different  re¬ 
gions,  high  spatial  correlation  between  nodes  does  not 
necessarily  imply  the  use  of  similar  routing  paths  for  di¬ 
recting  traffic  to  those  nodes. 

Hence,  we  can  partition  the  area  where  the  network  is 
deployed,  by  identifying  regions  that  are  described  by  the 
same  path  loss  model.  This  partition  is  followed  by  the 
application  of  refined  K-means  to  each  region,  resulting 
in  the  construction  of  sub-clusters.  The  combination  of 
those  sub-clusters  according  to  an  energy  minimization 
criterion  can  provide  a  desired  key  distribution  scheme. 

6.  CONCLUSION 

We  showed  that  the  secure  multicast  in  ad-hoc  networks 
must  consider  the  physical  location  of  the  members  in 
order  to  be  energy-efficient.  In  particular,  we  showed  that 
the  results  [1]  do  not  generalize  to  ad-hoc  networks.  Re¬ 
cent  past  work  had  implied  this  generalization  was  feasi¬ 
ble  [2].  We  also  presented  an  energy-aware  key  distribu¬ 
tion  scheme  that  relies  on  the  spatial  correlation  of  the 
members. 
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